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Recent studies in Kenya and Ghana have shown that individuals who inherit two malaria-protective genetic disorders of 
haemoglobin — cc + thalassaemia and sickle cell trait — experience a much lower level of malaria protection than those who in- 
herit sickle cell trait alone. We have previously demonstrated that this can limit the frequency of cc + thalassaemia in a population 
in which sickle cell is present, which may account for the frequency of « + thalassaemia in sub-Saharan Africa not exceeding 50%. 
Here we consider the relationship between a + thalassaemia and sickle cell in South Asian populations, and show that very high 
levels of a + thalassaemia combined with varying levels of malaria selection can explain why sickle cell has penetrated certain 
South Asian populations but not others. 
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It is widely accepted that the high frequencies of genetic blood 
disorders (haemoglobinopathies) seen in almost all old-world 
malarious regions are the result of malaria selection (Haldane 
1949; Flint et al. 1998). The mutation responsible for sickle 
cell anaemia (|3 S ) serves as a canonical example: the heterozy- 
gous condition (sickle cell trait) offers close to 90% protection 
against severe Plasmodium falciparum syndromes (Allison 1964; 
Hill et al. 1991; Williams et al. 2005a); the homozygous con- 
dition causes lethal sickle cell disease. The inactivation of one 
of the pair of alpha globin genes on chromosome 16 (a + thalas- 
saemia) also offers substantial malaria protection (Williams et al. 
2005b) but only causes a mild blood disorder in its homozygous 
state. 

Re-use of this article is permitted in accordance with the Terms and 
Conditions set out at http://wileyonlinelibrary.eom/onlineopen# 
OnlineOpen_Terms 



Recent studies have revealed an additional dimension to the 
relationship between f5 s and a + . A cohort study of 2104 children 
living in Kilifi, Kenya (Williams et al. 2005c) found that malaria 
protection was reduced to only 10% in children who inherited both 
sickle cell trait and homozygous a + . May et al. (2007) were able to 
extend this observation with their finding that Ghanaian children 
with sickle cell trait and heterozygous a + thalassaemia enjoy a 
lower degree of malaria protection than children with sickle cell 
trait alone (the odds ratio for severe malaria disease was 0.06 for 
sickle cell trait alone, increasing to 0.12 in the presence of ot + 
thalassaemia). Because sickle cell trait is caused by a mutation in 
beta globin, and a + thalassaemia by a mutation in alpha globin, 
this is a clear example of epistasis (here defined as the presence of 
a particular allele at one locus affecting the phenotypic outcome 
of an allele at a second locus). 

The mutation responsible for a + thalassaemia is found 
in nearly all old-world malarious regions (Flint et al. 1998; 
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Weatherall and Clegg 2001a). However, its frequency in sub- 
Saharan Africa does not exceed 50%, despite the intense malaria 
selection present and the relatively benign nature of homozygous 
c* + . We know that ct + is capable of reaching higher population fre- 
quencies elsewhere in the world: in Melanasia it can climb to 68% 
(Flint et al. 1986). In Williams et al. (2005c) we used a mathemati- 
cal model to demonstrate that a cancellation of malaria protection 
when p s is inherited alongside a + could account for why a + 
frequencies have been capped in sub-Saharan Africa. In Melana- 
sia, a lack of fl s has allowed a + to climb higher. 

Negative epistasis between alpha thalassaemia and sickle 
cell trait is not the only example of epistasis among the malaria- 
protective haemoglobinopathies. Alpha thalassaemia can interact 
with beta thalassaemia to result in a milder clinical phenotype than 
would have been seen with beta thalassaemia alone (Weatherall 
and Clegg 2001b). In a separate modelling exercise (Penman 
et al. 2009), we considered both positive and negative epistatic 
interactions together in the context of Mediterranean populations, 
and argued that positive epistasis in particular could have helped 
a combination of alpha and beta thalassaemia to keep the highly 
malaria-protective f$ s out of much of the Mediterranean region. 
This led us to the assertion that different patterns of malaria- 
protective haemoglobinopathies in different populations may be 
partly maintained by interactions among the genes themselves. 

In the Middle East, where sickle cell and a + thalassaemia 
coexist, the pattern does not appear to be too different from 
that seen in sub-Saharan Africa. In Saudi Arabia, for exam- 
ple, the highest reported frequency of a + thalassaemia is 0.55 
(El-Hazmi and Warsy 1999). In South Asia, the pattern is much 
more variable. Table 1 summarizes South Asian data from tribal 
populations from three locations: Orissa (India); Andhra Pradesh 
(India), and the Terai region of India and Nepal, obtained from the 
literature and from sequencing work first reported in this paper. 
These represent populations living in malarious regions for which 
data on both sickle cell and alpha thalassaemia are available. 

As shown in Table 1 , widely varying frequencies of both a + 
and p s may be observed in these areas, but it is clear that while 
all groups possess high frequencies of alpha thalassaemia, not all 
groups possess sickle cell. The contrast is particularly striking 
for the central and Western Terai Tharu populations: Tharus in 
the Western Terai region in Nepal have both sickle cell (0.05) 
and a high frequency of a + thalassaemia (0.72), but Central Terai 
Tharus have an a + thai frequency of 0.83 and no sickle cell at 
all (Modiano et al. 1991). Indian Tharus from the Terai region of 
Uttar Pradesh have near- fixation of a + , at a frequency of 0.94 
(Sinha et al. 2009) as well as sickle cell. In this article, we ex- 
tend the modelling work carried out in Williams et al. (2005c) 
and Penman et al. (2009) to ask whether these data from South 
Asia can be reconciled with the negative epistasis between a + 
thalassaemia and sickle cell documented in Africa. 



Methods 

To explore the population genetic consequences of negative epis- 
tasis between a + thalassaemia and sickle cell trait, we considered 
a population containing nine possible genotypes constructed from 
four possible gametic types: ap\ a + p\ a|3 s , and a + p s . 

a + p7a|3 s is equivalent to ap/a + p s . The frequency of genotype 
i is given by yi . 

The rate of change of frequency of each genotype with time 
was given by: 

dyt 

— =kF(i)- + r,[L m )yi, 
at 

where 

p,, = mortality rate of genotype / in the absence of malaria, 
[i m = excess mortality that malaria exacts upon the population, 

r, = relative susceptibility to death from malaria of genotype i. 

The term k gives the total birth rate, and was calculated 
so as to keep the total population size constant (see Supporting 
information). 

F(i) apportions the total birth rate into different genotypes 
according to the frequencies of their possible constituent game- 
typic types following simple rules of panmixia (see Supporting 
information). Alpha and beta globin are encoded on chromosomes 
16 and 11, respectively, so we have modeled them as being com- 
pletely unlinked. 

We investigated our model's behavior by carrying out many 
numerical simulations, systematically sampling parameter space 
to ensure that we had found all possible stable outcomes. 

The dynamical framework presented here could be reframed 
into a more conventional population genetic model by calculating 
a life expectancy for each genotype (the inverse of its total mor- 
tality rate) and taking the ratio of the life expectancy of a given 
genotype to the longest life expectancy in the population as the 
relative fitness of that genotype. We chose to use a dynamical 
approach because the two mortality rates assigned to each geno- 
type make it obvious what we have assumed about blood disorder 
severity; malaria protection and epistasis for each genotype, rather 
than subsuming both into a single fitness estimate. This type of 
dynamical approach has been employed in previous studies of 
malaria resistance (Gupta and Hill 1995; Ruwende et al. 1995). 

The mortality rates used in this article are intended to demon- 
strate the range of possible model behaviors rather than provide 
a specific historical recreation. As can be seen in Table SI, we 
chose 0.03 years -1 as the mortality rate of normal individuals 
in the absence of death from malaria, implying an average life 
expectancy of 33 years. This seems reasonable, but is an arbitrary 
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baseline — what matters are the relative values of the mortality 
rates assigned to each genotype. If pi, = 0.03 and [L„, = 0.01, this 
implies that malaria is responsible for 25% of the total mortality 
of wild-type individuals in that population. 

Results and Discussion 

Figure 1 demonstrates that the inclusion of negative epistasis has 
a dramatic impact on the equilibrium frequencies of a + and p" s 
(as already discussed in Williams et al. 2005c). In the absence 
of epistasis, <x + always reaches fixation in the population along- 
side p s ; when epistasis is included, antagonism between a + and 
P s can limit the former's frequency. However, under negative 
epistasis the system is highly sensitive to the level of malaria 
selection. At lower levels of malaria selection, a + can become 
fixed in the population to the complete exclusion of p s ; the co- 
existence of a + and |3 S is only possible above a certain selection 
threshold. 

Figure 2 considers the possibility of pre-existing high fre- 
quencies of a + acting against |3 S . Clearly, even under strong 
malaria selection, a high frequency of a + can stop |3 S from in- 
vading the population. In Figure 2 we also consider the effect 
of introducing a cost to the a + a + genotype. Nowadays, the mild 
anaemia associated with a + a + is not regarded as a significant 
health concern, but historically even this mild anaemia may have 
led to a small increase in mortality (e.g., during childbirth), so a 
small blood disorder related cost seems plausible. Unsurprisingly, 
the higher the cost of homozygosity for a + , the harder it is for a + 
to keep p s out of a population. Figure SI illustrates how this cost 
affects the equilibrium frequencies of a + and |3 S . 

The key observation from Figures 1 and 2 is that, in the 
presence of negative epstasis, malaria selection has a nonlinear 
effect on allele frequencies of a + and p s . As shown in Figure 2, a 
small change in malaria selection pressure can move a population 
from a scenario where a + excludes p s to one where |3 S keeps a + 
in check. 

As discussed in the Introduction, and detailed in Table 1, f5 s 
coexists with a high frequency of a + thalassaemia in the Tharus 
of the Western Terai. However, Central Terai Tharus possess no 
P s , just a + thalassaemia alone. Similarly, the Munda tribe of 
Orissa possesses sickle cell, whereas the Oraon do not — but both 
have high frequencies of a + . Figure 2 illustrates that a change in 
malaria selection pressure could allow f$ s to invade an ct + rich 
population where previously it was excluded. Negative epistasis 
between ct + and p s combined with varying malaria selection thus 
offers a potential explanation for the heterogeneity in sickle cell's 
distribution among the Tharu. 

Figure 3 provides a time series to illustrate this point. Panels 
(a) and (b) illustrate two populations that are subject to a high 
level of malaria selection, and accumulate a high frequency of 



3628 



EVOLUTION DECEMBER 2011 



BRIEF COMMUNICATION 





1.00 - 




0.90 - 


u 


0.80 






(V 






0.70 


at 




< 


0.60 






cu 






0.50 


t 






0.40 






-Q 






0.30 


'5 




D" 




LU 


0.20 




0.10 ■ 




o.oo - 



»OOOCKXKDCCCOCOOOaXi 



A 




no epistasis 


O 


a + 


no epistasis 


▲ 


3 s 


with epistasis 


• 


a* 


with epistasis 



A A 



A A 



AAA Ai AA AAAAAiAAA 



0.10 



0.20 



0.30 



0.40 



0.50 



Proportion of total wild type mortality due to malaria 

Figure 1. This figure shows how the equilibrium frequencies of a + and (5 s change with differing levels of malaria selection, with and 
without negative epistasis. These equilibrium frequencies are obtained after 150,000 years; a+ and |3 S both started out at a frequency of 
0.001 in the population. The blood disorder mortality rates and relative susceptibility to death from malaria for each genotype are the 
unbracketed figures given in Table S1 (figures in italics were used in the "no epistasis" scenario). 



ot + . After 2500 years, the p s allele arrives in both populations. 
In population (a), this coincides with an increase in the level 
of malaria selection, and f$ s is kept out. In population (b), this 
coincides with a drop in the level of malaria selection and p s is 
allowed in. The fact that an increase in malaria selection keeps 
P s out is related to the properties we assigned the a + allele in 
this particular example: as can be seen from Figure 2, there are 
some regions where an upwards shift in malaria selection will act 
against p s invasion, and others where an upwards shift in malaria 
selection will favor p s invasion. 

The very high frequencies of alpha thalassaemia seen along- 
side sickle cell in the tribal groups of Andhra Pradesh present 
a conundrum, in that there is no evidence for sickle cell being 
excluded from any population, and the sickle cell frequencies 
observed are very high (Table 1). Under the negative epistasis 
model, we have to explain high frequencies of alpha thalassaemia 
coexisting with high frequencies of sickle cell as being far from 
equilibrium: this could be due to rapid changes in malaria se- 
lection pressure, or the effects of population admixture. In the 
specific case of the Koya Dora of AP, it might be significant that 
10% of alpha thalassaemia in this population is due to a unique 
mutant of the normal a chain termination codon, dubbed HbKD 
(De Jong et al. 1975). This mutation leads to an elongation of the 
alpha globin subunit, similar to that caused by Haemoglobin Con- 
stant Spring (Weatherall and Clegg 2001b), and it is conceivable 
that this elongated alpha globin interacts with sickle beta globin 
in a way that avoids negative epistasis. 



The majority of alpha thalassaemia in India, however, is 
caused by the same types of deletion as are found in Africa. 
Unknown effects of HbKD aside, the most likely mechanism 
behind the negative epistasis observed in Kenya ought to ap- 
ply equally in India. All sickle haemoglobin is formed of 2 alpha 
globin and 2 mutated sickle beta globin subunits. The electrostatic 
properties of the globin subunits mean that alpha globin partners 
preferentially with normal beta globin over sickle beta globin 
(Bunn 1987). In erythrocytes with both sickle cell trait and a + 
thalassaemia, there will be a limited supply of alpha globin; thus, 
a red blood cell with sickle cell trait and alpha thalassaemia will 
have a lower intracellular concentration of sickle haemoglobin 
than a red blood cell with sickle cell trait alone. Williams proposes 
that the malaria-protective properties of sickle cell trait must rely 
upon the intracellular concentration of sickle haemoglobin — so 
accounting for the loss of malaria protection in sickle cell-a + 
thalassaemic erythrocytes. 

The diversity of different mutations present in populations 
can sometimes be used to gauge the degree of population admix- 
ture, or trace migration events that have occurred. In the case of 
sickle cell, almost all sickle cell in India occurs on the same beta 
globin haplotype, indicating a spread from a common origin (Flint 
et al. 1998). Flint et al. note in their review that the mutation may 
have arisen in or been imported into a single Indian population, 
which then became dispersed following invasions from the North 
at some point during the last 5000 years. This could certainly 
account for some of the heterogeneity in the distribution of sickle 
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Figure 2. The balance of power between a + and FJ S . Panels (A- 
C) illustrate the frequency of a + required to prevent sickle cell 
successfully invading the population, where negative epistasis is 
present. In panel (A), homozygous a + thalassaemia carries no cost 
(mortality rate — 0.03 years -1 ). In panel (B), it has been assigned 
a mortality rate of 0.031 years -1 , and in panel (C) it has been as- 
signed a mortality rate of 0.032 years -1 . All other mortality rates 
are as in Table S1, and include negative epistasis. (3 s is given an 
initial frequency of 0.001 in all cases, and "prevention of inva- 
sion" is defined as (3 s being at a frequency below 0.00005 after 
50000 years. 

cell in South Asia, but cannot explain why some Tharus have it 
but others do not. 

In terms of alpha thalassaemia: table one makes clear that two 
types of deletion are responsible for most alpha thalassaemia in 



these populations (-a 3 ' 7 and -a 4 ' 2 ). The -a 3 ' 7 and -a 4 2 deletions 
arise through unequal recombination; are responsible for most al- 
pha thalassaemia worldwide (Weatherall and Clegg 2001b), and 
have indistinguishable phenotypic effects (Williams et al. 1996). 
The predominance of the -a 3 7 deletion in the Tharu may indicate 
especially strong malaria selection (leading to the rapid spread of 
the first deletion that occurred), or perhaps a lack of population 
admixture — but further modelling work is necessary to investi- 
gate exactly how different recombination rates, mutation rates, 
selection levels, and migration between populations interact to 
determine alpha thalassaemic diversity. 

As noted in the Introduction, we have already argued that 
a combination of alpha and beta thalassaemia may be helping 
to exclude sickle cell from much of the Mediterranean (Penman 
et al. 2009). Alpha thalassaemia frequencies in the Mediterranean 
are lower than in the populations considered here, and we did not 
consider exclusion by alpha thalassaemia alone a robust explana- 
tion for the rarity of sickle cell in Greece and Cyprus, but positive 
epistasis between alpha and beta thalassaemia could have assisted 
beta thalassaemia in outcompeting sickle cell. In addition to beta 
thalassaemia in the Mediterranean, sickle cell competes with beta 
thalassaemia and the structural variant haemoglobin C (HbC) in 
West Africa (Livingstone 1976; Hedrick 2004; Modiano et al. 
2007), and mutual exclusion appears to occur between p" s and 
haemoglobin E (HbE), in Asia (Weatherall and Clegg 2001a, see 
Figure 2). Could competition with another beta globin variant be 
responsible for some of the patchiness in sickle cell's distribution 
in India and Nepal? 

Modiano et al. (1991) observed a low frequency (0.02) of 
beta thalassaemia in Central Terai Tharus. It is conceivable that 
this small amount of beta thalassaemia acts synergistically with 
alpha thalassaemia to exclude sickle cell from this population, 
as we suggested in the context of the Mediterranean in Penman 
et al. (2009), but it is just as possible that alpha thalassaemia 
kept sickle cell out, allowing beta thalassaemia to appear later. 
Unfortunately, Modiano et al. (1991) did not record the frequency 
of beta thalassaemia in the Western Terai Tharu population — 
further studies of Tharu populations may help to clarify this point. 

Varying levels of beta thalassaemia occur in the tribal popu- 
lations of Orissa (Balgir 2006a,b). Of the populations we consider 
in Table 1 , the Munda were reported as having a beta thalassaemia 
allele frequency of 0.026, and the Oraon a frequency of 0.009. 
Balgir observed a nonsignificant negative correlation between |3 S 
and beta thalassaemia frequencies. It seems likely that p s and 
beta thalassaemia compete in this region, but the relationship be- 
tween them is far from straightforward. The results we present 
here suggest that to understand heterogeneity in the distribution 
of f5 s fully, we must consider alpha as well as beta thalassaemia. 

Prior to Williams et al. (2005c), a different form of epistatic 
interaction between a + and p s had been suggested: namely that 



3630 



EVOLUTION DECEMBER 2011 



BRIEF COMMUNICATION 



B 





1 




0.9 




0.8 


u 


0.7 






cu 




cr 


0.6 


CD 








4 — 

CD 


0.5 


_CD 




< 


0.4 




0.3 




0.2 




0.1 




0 



0 2000 4000 6000 8000 10000 12000 

Time (years) 



l 

0.9 
0.8 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 
0 



a+ frequency 

BSfrequency 



0 2000 4000 6000 8000 10000 12000 

Time (years) 



Figure 3. The behavior of a + and |3 S , with epistasis and varying malaria selection levels. Panels (A) and (B) indicate how the equilibrium 
frequencies of a + and p s change over time. In both panels, the initial frequency of a + was 0.01 at time 0 and malaria was responsible 
for 19% of the wild-type mortality over the first 2500 years. After 2500 years, p s was introduced at a frequency of 0.001, and the levels 
of malaria selection changed as follows: in panel (A) malaria became responsible for 27% of total wild-type mortality, and in panel 
(B) malaria became responsible for 14% of the total wild-type mortality. In these scenarios, we assumed there was a slight cost to the 
a + a + phenotype (it was assigned a mortality rate of 0.031 years -1 compared to the wild type 0.03 years -1 ). 



a-thalassaemia may be able to ameliorate some of the adverse 
symptoms of sickle cell anaemia. The proposed mechanism for 
this effect once again hinges on a limited pool of a-globin lead- 
ing to a lower concentration of sickle haemoglobin in red blood 
cells. In the case of sickle cell anaemia, there is only sickle beta 
globin available — no normal beta globin — but some of the alpha 
globin binds to the very small amount of delta globin available in 
erythrocytes, and when alpha globin is limited this becomes more 
important. Although it is clear that alpha thalassaemia can change 
the hematological profile of sickle cell disease, it is less clear that 
this has any beneficial effect in terms of increased survival (see 
Weatherall and Clegg 2001b, especially table 11.8 on p. 523). 
Nevertheless, in the Supporting information, we investigate how 
such an effect might interact with negative epistasis canceling 
malaria protection for sickle heterozygotes. It does not seem to 
alter the overall pattern (Fig. S2). 

The strikingly different haemoglobinopathy patterns ob- 
served in different world regions each represent an alternative 
evolutionary answer to the problem of malaria. It is starting to 
become clear that the success or failure of particular beta globin 
mutations may be affected by the presence or absence of alpha 
globin mutations, and vice versa. Here we have presented a de- 
tailed exploration of the range of behaviors allowed when ot + and 
P s interact via negative epistasis. Small changes in malaria selec- 



tion pressure can dramatically alter the prospects of sickle cell 
invading a population. Such effects could account for some of the 
variation in sickle cell frequency among specific populations in 
India and Nepal. 
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The following supporting information is available for this article: 

Figure SI. The effect of varying the severity of a + thalassaemia on the equilibrium frequencies of a + and |3 S . 
Figure S2. Allowing a + thalassaemia to alleviate sickle cell anaemia. 
Table SI. Mortality rates used to generate the figures. 

Supporting Information may be found in the online version of this article. 
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